NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Split Computing With Scalable Feature Compression for Visual Analytics on the Edge

https://doi.org/10.1109/TMM.2024.3406165

Yuan, Zhongzheng; Rawlekar, Samyak; Garg, Siddharth; Erkip, Elza; Wang, Yao (May 2024, IEEE Transactions on Multimedia)

Full Text Available
Scalable Feature Compression for Edge-Assisted Object Detection Over Time-Varying Networks

Yuan, Zhongzheng (May 2023, In MLSys Workshop on Resource-Constrained Learning in Wireless Networks)

Split-computing has recently emerged as a paradigm for offloading computation of visual analytics models from low-powered mobile devices to edge or cloud servers, by which the mobiles execute part of the model and compress and send the intermediate features, and the servers complete the remaining model computation. Prior feature compression approaches train different compression models and possibly visual analytics models to reach different target bit rates. We propose a scalable compression model that compresses the intermediate features of the YOLO object detection model into a layered bitstream, which can be easily adapted to meet the rate constraint of a dynamic network. Our approach achieves comparable rate-accuracy performance compared to prior non-scalable compression approaches over a large bitrate range.
more » « less
Full Text Available
Feature Compression for Rate Constrained Object Detection on the Edge

Yuan, Zhongzheng; Samyak Rawlekar; Siddharth Garg; Elza Erkip; Yao Wang (August 2022, In MLSys 2023 Workshop on Resource-Constrained Learning in Wireless Networks)

Recent advances in computer vision has led to a growth of interest in deploying visual analytics model on mobile devices. However, most mobile devices have limited computing power, which prohibits them from running large scale visual analytics neural networks. An emerging approach to solve this problem is to offload the computation of these neural networks to computing resources at an edge server. Efficient computation offloading requires optimizing the trade-off between multiple objectives including compressed data rate, analytics performance, and computation speed. In this work, we consider a “split computation” system to offload a part of the computation of the YOLO object detection model. We propose a learnable feature compression approach to compress the intermediate YOLO features with lightweight computation. We train the feature compression and decompression module together with the YOLO model to optimize the object detection accuracy under a rate constraint. Compared to baseline methods that apply either standard image compression or learned image compression at the mobile and perform image de-compression and YOLO at the edge, the proposed system achieves higher detection accuracy at the low to medium rate range. Furthermore, the proposed system requires substantially lower computation time on the mobile device with CPU only.
more » « less
Full Text Available
Feature Compression for Rate Constrained Object Detection on the Edge

https://doi.org/10.1109/MIPR54900.2022.00008

Yuan, Zhongzheng; Rawlekar, Samyak; Garg, Siddharth; Erkip, Elza; Wang, Yao (August 2022, 2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR))

Full Text Available
Network-Aware 5G Edge Computing for Object Detection: Augmenting Wearables to “See” More, Farther and Faster

https://doi.org/10.1109/ACCESS.2022.3157876

Yuan, Zhongzheng; Azzino, Tommy; Hao, Yu; Lyu, Yixuan; Pei, Haoyang; Boldini, Alain; Mezzavilla, Marco; Beheshti, Mahya; Porfiri, Maurizio; Hudson, Todd E.; et al (January 2022, IEEE Access)

Search for: All records